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In eukaryotic cells, genomic DNA replicates in a defined temporal order. The inactive X chromosome [Xi), the most 
extensive instance of facultative heterochromatin in mammals, replicates later than the active X chromosome [Xa), but the 
replication dynamics of inactive chromatin are not known. By profiling human DNA replication in an allele-specific, 
chromosomally phased manner, we determined for the first time the replication timing along the active and inactive 
chromosomes [Xa and Xi) separately. Replication of the Xi was different from that of the Xa, varied among individuals, 
and resembled a random, unstructured process. The Xi replicated rapidly and at a time largely separable from that of the 
euchromatic genome. Late-replicating, transcriptionally inactive regions on the autosomes also replicated in an un- 
structured manner, similar to the Xi. We conclude that DNA replication follows two strategies: slow, ordered replication 
associated with transcriptional activity, and rapid, random replication of silent chromatin. The two strategies coexist in 
the same cell, yet are segregated in space and time. 



[Supplemental material is available for this article.] 

DNA synthesis in eukaryotic cells initiates from many origins of 
replication. Only a small subset of the available origins is used 
within a given cell cycle, and the specific origins that are utilized 
differ from cell to cell. Nevertheless, at large scales, genomic loci 
replicate in a reproducible temporal order that is thought to result 
from variations along the genome in the rates and preferred acti- 
vation timing of replication origins. A notable exception to pro- 
grammed genome replication occurs in early embryonic devel- 
opment in Xenopus laevis and Drosophila melanogaster. During 
cleavage cell divisions, when there is no transcriptional activity, 
DNA replication initiates from closely spaced loci that are located 
randomly with respect to the DNA sequence, and is completed in 
a very short time (several minutes, compared with hours in adult 
cells) (Hyrien and Mechali 1993; Sasaki et al. 1999). DNA replica- 
tion becomes organized, with replication origins activated at spe- 
cific times and locations, only at the mid-blastula transition, 
concomitant with the onset of zygotic transcription (Hyrien et al. 
1995; Sasaki et al. 1999). The observation that structured replica- 
tion is established just as transcription commences suggests that 
a strict replication program may be required for the regulation of 
gene activity. However, random replication has not been observed 
in mammals or outside the context of embryonic development. 

In mammals, developmentally regulated, chromosome-wide 
transcriptional silencing occurs in the process of X chromosome 
inactivation (XCI). XCI is a dosage compensation mechanism in 
which one of the two X chromosome copies in females is tran- 
scriptionally inactivated (Lyon 1961). The inactive chromosome is 
randomly chosen and subsequently clonally maintained through 
an epigenetic mechanism that involves coating of the chromo- 
some by the noncoding RNA XIST (X-inactivation specific tran- 
script), DNA hypermethylation, histone modifications, and other 
chromatin marks (for review, see Lee 2011). The Xi localizes to the 
perinucleolar compartment during mid-to-late S phase (Zhang 
et al. 2007) and adopts a three-dimensional conformation that 
was recently shown to be random in mice (Splinter et al. 201 1). The 
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Xi replicates later in S phase than the Xa (Gilbert et al. 1962; 
Morishima et al. 1962; Hansen et al. 1996, 2010; Chadwick and 
Willard 2003; Heard and Disteche 2006). Whether the Xi follows 
a defined replication timing program remains unknown, although 
early microscopy studies suggested that its replication differs from 
that of the Xa (Gilbert et al. 1962; Morishima et al. 1962; Willard 
and Latt 1976; Schempp and Meer 1983). 

Results 

We recently described the profiling of genomic DNA replication in 
lymphoblastoid cell lines (LCLs) from two father-mother-daugh- 
ter trios by sequencing DNA from FACS-sorted Gl and S phase 
cells. Replication timing was inferred from the fluctuations in the 
abundance (read depth) of DNA sequences along chromosomes: 
The earlier a locus replicates, the higher its abundance in genomic 
DNA in S phase cells (Koren et al. 2012). We next sought to de- 
termine the replication profiles of each of the 46 chromosomes 
individually. We identified all heterozygous SNPs (median spacing 
of —800 bp) from the genome sequence of these cell lines (The 
1000 Genomes Project Consortium 2010) and assigned sequence 
reads to individual chromosomal copies by using inheritance in 
the trios to determine chromosomal phase for each SNP allele 
(Methods). We thus obtained replication profiles for each of the 46 
chromosomes, and were further able to distinguish the active from 
the inactive X chromosome (Methods; Supplemental Fig. SI). 

The timing of Xi replication was mostly distinct from that of 
all other chromosomes: 75% of the Xi commenced replication 
after almost 75% of the rest of the genome had already completed 
replication (Fig. 1A). Intriguingly, despite being the last chromo- 
some to initiate replication and having the latest average replica- 
tion time, the Xi still completed replication no later than the rest of 
the genome (Fig. 1A), suggesting that the Xi replicates more rapidly 
than other chromosomes do. A direct analysis revealed that the 
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Figure 1. Replication dynamics of the 46 human chromosomes. (A) Distribution of replication timing for each of the 46 chromosomes in lympho- 
blastoid cell line (LCL) NA1 9240. For each chromosome, thin vertical lines show the distribution of replication times; thick bars show the 25th and 75th 
percentiles; and the horizontal line shows the median. (Gray) Maternally inherited chromosomes; (black) paternally inherited chromosomes; (green) the 
active X chromosome (Xa); (blue) the inactive X chromosome (Xi). Dashed horizontal line is the Xi 25th percentile. The Xi replicated at a time separable 
from most of the genome. Results for LCLs from the other two females were similar (data not shown). (B) Replication speed (the inverse of the replication 
time span) of the bulk (interquartile range) of each chromosome in cell line NA19240. The dashed horizontal line represents the autosomal average. 
Results for LCLs from the other two females were similar (data not shown). Note that the high-GC content chromosome 22 is not shown since its values 
were unreliable due to relatively poorer data quality. (C) Smoothed chromosome 7 replication profiles of LCLs derived from three females (and one 
experimental replicate; green and blue) and one male (cyan). The replication profile was similar among individuals and between homologous chro- 
mosomes. (Mat) maternal; (Pat) paternal; (1,2) homologous chromosome copies for which parent-of-origin is unknown (Supplemental Fig. SI); [(2)] 
experimental replicate. Replication time is normalized as a Z-score (Koren et al. 201 2). Allelic similarity is the similarity in replication timing between the 
homologous chromosome copies; correlation score represents the similarity of replication pattern between the homologs (Methods). (Gray vertical lines) 
Centromere. Results for other autosomes were similar (data not shown). (D) Smoothed replication profiles of chromosome X, showing the delayed, 
unstructured, and variable replication timing of theXi relative to the consistent and structured Xa. The X (cyan) and Y (red) chromosomes of two males are 
shown; the correlation between the two Y chromosomes was r = 0.89. Also shown are the tendency of genes to escape X inactivation, in fractional units, 
and the X chromosome evolutionary strata. (P) Pseudoautosomal region; (S) stratum. PARI and strata 4 and 5 showed the typical autosomal signature of 
allelic similarity. No other regions on the Xi appeared to have any significant replication structure at the sensitivity level of detection of our method (—0.5 
standard units of replication timing, corresponding to —10% of the replication time span — see panel C). XIST replication timing is considered more 
specifically in Supplemental Figure S3. See Supplemental Figure S2 for more detailed images of X chromosome replication, including replication profiles 
obtained with higher coverage data for specific regions of the chromosome. 



bulk of Xi replication (defined by the interquartile range of repli- 
cation timing) completed approximately twice as fast as any other 
chromosome, and almost three times faster than the bulk of Xa 
replication (Fig. IB). 

The replication profiles of the autosomes were visually in- 
distinguishable between homologous chromosome copies, be- 
tween different individuals, and between experimental replicates 
(Fig. 1C; Supplemental Fig. S2). We verified this statistically using 
a correlation analysis, which revealed highly correlated replication 
timing in each of these comparisons (mean r = 0.85) (Fig. 2A). 
Analysis of the X chromosome, however, revealed a strikingly 
different pattern. Xa replication profiles were similar among fe- 
males and consistent between experimental replicates (Figs. ID, 
2A); they were also similar to the replication profiles of the X 
chromosome in males (mean r = 0.85). In contrast, replication of 
the inactive X chromosomes was far less similar (mean r = 0.39) to 
that of the active copies, with only a few zones of limited similarity 
(Figs. ID, 2A). The Xi replication patterns also differed among the 
three females (mean r = 0.37) and between experimental repeats 
(r = 0.39, compared with r = 0.79 for the Xa). Furthermore, the 
"diploid" X chromosome profiles (which represent a composite of 



the two chromosomal copies not limited by SNPs) closely re- 
sembled the Xa but not the Xi profiles. Taken together, these re- 
sults suggest that the Xi does not replicate according to a spatially 
structured program. 

To more formally test for a replication structure on the Xi, we 
used autocorrelation analysis to evaluate the correlation between 
the replication timing of distinct sites as a function of the physical 
distance between them (Methods). We separately analyzed auto- 
correlation of read depth in S phase cells, which represents the 
continuous process of DNA replication, and in Gl cells, a control 
that will manifest any technical fluctuations in sequence read 
depth (Koren et al. 2012). As expected, Gl phase cells exhibited 
only a slight positive autocorrelation (Fig. 2B), explained by the 
known effect of GC content on read depth. On the other hand, 
S phase DNA abundance across the autosomes and the Xa ex- 
hibited a strong autocorrelation for distances of several hundred 
kilobases, indicating a spatially structured program (Fig. 2B). 
Strikingly, in all three females, S phase DNA abundance of the Xi 
had autocorrelation that was no stronger than that of Gl DNA, 
indicating that the Xi replication pattern resembles a random 
process (Fig. 2B). 



Genome Research 65 



www.genome.org 



Koren and McCarroII 



Chr7 



Chr8 



Chr9 



ChrX 



NA19240 Mat 
NA12878 Mat 
NA12892 1 
NA19240 Mat (2 
NA19240 Pat 
NA12878 Pat 
NA12892 2 
NA19240 Pat (2) 
NA19240 Dip 
NA12878 Dip 
NA12892 Dip 
NA19240 Dip (2) 




B 



Chr7 



Chr8 



Chr9 



ChrX 



NA19240 




0 200 400 600 

Distance (Kb) 



0 200 400 600 

Distance (Kb) 



0 200 400 600 

Distance (Kb) 



0 200 400 600 

Distance (Kb) 



Figure 2. Structure and correlations of chromosomal replication profiles. (A) Correlation of replication timing for pairs of homologous chromosomes, 
within and between individuals and experimental replicates, shown as correlation matrices. For the autosomes, replication profiles were similar among 
individuals and between chromosome homologs in the same individual. Replication of the Xa was also consistent among individuals; however, replication 
of the Xi was different from the Xa, was uncorrelated among individuals, and was inconsistent between experimental replicates. (Mat) Maternal; (Pat) 
paternal; (1,2) homologous chromosome copies for which parent-of-origin is unknown (Supplemental Fig. SI); (Dip) diploid (aggregated across the two 
homologous chromosome copies); [(2)] experimental replicate. (B) Autocorrelations of DNA replication timing. The highly structured replication of the 
autosomes and the Xa is visible as long-range autocorrelation of DNA copy number in S phase genomes (but not in control, G1 phase genomes). In 
contrast, the inactive X chromosome shows no more autocorrelation in S phase than in G1 phase cells. Results for NA1 2892 were similar (data not shown). 
For both A and B, the 8-Mb left-distal part of the X chromosome was excluded from the analyses (see text), and results for other autosomes were similar 
(data not shown). 



We then evaluated the extent of replication randomness 
along the X chromosome and among different evolutionary do- 
mains of the X chromosome. Mammalian sex chromosomes 
evolved from a pair of autosomes through a series of inversions on 
the Y chromosome that led to loss of recombination and sub- 
sequent sequence divergence between the X and Y chromosomes. 
Five discrete evolutionary strata of progressive X-Y evolutionary 
divergence have been observed. Sequence homology and re- 
combination activity are retained at the 2.7-Mb-long pseudoau- 
tosomal region 1 (PARI) and the 0.3-Mb-long PAR2 on the left and 
right ends, respectively, of the X and Y chromosomes (Fig. ID; Ross 
et al. 2005). 

We analyzed the average difference in replication timing 
and the consistency of replication pattern between the two copies 
of each chromosome along their entire lengths. Autosomes ex- 
hibited a characteristic signature of consistent replication structure 
and timing for each pair of chromosome homologs (Fig. 1C). In 
contrast, the X chromosome exhibited low allelic similarity and 
correlation throughout its length, consistent with a ds-effect on 
replication timing and structure (Fig. ID). The notable exception 
was the ~8-Mb region on the distal short arm of the X chromo- 
some, which contains PARI and evolutionary strata 4 and 5. This 
region replicated relatively early and exhibited replication patterns 
typical of autosomes, including similar replication timing and 



structure for the homologous chromosome pairs (Xa and Xi) 
(Fig. ID; Supplemental Figs. S2, S3). Remarkably, the replication 
pattern of the corresponding 8-Mb area on the left end of the Y 
chromosome also showed a similar replication pattern, even 
though X-Y syntenic homology extends only to the PARI region 
(Fig. ID; Supplemental Fig. S2). We conclude that an ~6-Mb region 
beyond PARI replicates in an ordered and coordinated way on the 
Xa, Xi, and Y chromosomes. Intriguingly, the same region also 
contains a cluster of genes that escape X inactivation and are 
expressed from both the Xa and Xi (Fig. ID; Carrel and Willard 
2005). This area is thought to have been pseudoautosomal in re- 
cent evolutionary history (Ross et al. 2005), and we propose that it 
is still effectively pseudoautosomal from an epigenetic point of 
view, i.e., that it retains similar epigenetic determinants of repli- 
cation timing on the X and Y chromosomes and is not subject to 
dosage compensation on the X chromosome. 

The observed lack of replication structure on Xi could arise 
from features specific to X inactivation, or could represent a gen- 
eral property of late-replicating chromatin on all chromosomes. 
To distinguish between these possibilities, we partitioned auto- 
somal genomic segments into eight groups based on their repli- 
cation timing and analyzed the autocorrelation along genomic 
segments in each group (Fig. 3). Autocorrelation of replication 
timing decreased as S phase progressed, indicating a gradual loss of 
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Figure 3. Random, unstructured replication of late-replicating, tran- 
scriptionally inactive genomic segments on the autosomes. Autosomal 
segments were divided into eight groups based on their replication tim- 
ing. Shown for each group are the autocorrelation of replication timing at 
a ~2-kb scale (averaged over all six individuals profiled; Methods), the 
number of RefSeq genes, average level of gene expression (measured by 
number of RNA-seq reads), and density of DNase I hypersensitive sites 
(DHS) representing the 5% most open chromatin regions in lympho- 
blastoid cell lines. Error bars, SE. All data are scaled relative to the maximal 
values for each data type. 

replication structure. Furthermore, autocorrelation decreased along 
with gene density, transcription levels, and the proportion of open 
chromatin in the genomic segments being replicated (Fig. 3). Thus, 
random replication is a general property of late-replicating, tran- 
scriptionally inactive chromatin, and cells transition from slow, 
organized replication to fast, random replication as S phase prog- 
resses. The increase in replication randomness as S phase progressed 
was sufficient to account for most, if not all, of the randomness of 
the Xi (Supplemental Fig. S4). 

Discussion 

In this study, we have determined for the first time the detailed 
replication dynamics of the human active and inactive X chro- 
mosomes. Consistent with single-cell microscopy studies (Gilbert 
et al. 1962; Morishima et al. 1962), the Xi replicated much later 
than the Xa. Furthermore, we found that the Xi did not follow any 
defined temporal replication pattern: The replication profiles of Xi 
chromosomes were inconsistent among females and between ex- 
perimental repeats, and did not show a continuous spatial pattern 
as measured by an autocorrelation analysis. A single exception was 
an 8-Mb region on the distal short arm of the X chromosome. 
Although the spatial and temporal resolution of our method might 
not be sufficient for the identification of small patches of ordered 
replication along the inactive X chromosome, these are expected 
to be limited to — at most — submegabase scales, and to a fraction of 
the time span within the already confined late replication of the 
Xi. The random pattern of replication of the Xi that we observe is 
to be distinguished from the previously described "stochastic" 
pattern of origin activation: While the latter still follows probabi- 
listic patterns that give rise to structured replication profiles in 
population measurements (and is thought to follow time con- 
straints in mammalian cells), Xi replication appears to be literally 
random, giving rise to population-level timing patterns that are 
not more structured than noise is. 

These results provide the first demonstration of random rep- 
lication in somatic human cells. Replication of the Xi was rapid, 



random, and associated with transcriptional quiescence, re- 
capitulating all of the properties of early embryonic replication in 
frogs and flies. Random, rapid replication could be explained by 
stochastic firing of a large number of closely spaced origins (Fig. 4), 
as observed in frog and fly embryos (Hyrien and Mechali 1993; 
Hyrien et al. 1995; Sasaki et al. 1999). The gradual loss of replica- 
tion structure as S phase progresses (Fig. 3) is consistent with ele- 
vated origin initiation rates at later times during S phase, as ob- 
served in single-molecule analyses in frogs (Herrick et al. 2000), 
human cells (Guilbaud et al. 2011), and fission yeast (Patel et al. 
2006), and as predicted by mathematical models (Rhind 2006). 
Such an increase in firing probabilities as S phase progresses — 
which at the extremity of very late S phase results in random 
replication — has been suggested to be a principle property of DNA 
replication in eukaryotes (Goldar et al. 2009). 

Our results suggest that DNA replication in eukaryotes can 
proceed very rapidly without a particular order. In contrast, in 
the presence of transcriptional activity cells utilize a structured 
replication program that comes at a substantial cost in replica- 
tion speed. This tradeoff suggests a need to coordinate replica- 
tion timing with transcription, or a role for replication timing 
in preserving the epigenetic information that is required to reg- 
ulate transcription. Such a role could be accomplished via dy- 
namic changes in replication-associated chromatin modifying 
activities as chromatin is reassembled on newly synthesized DNA 
(Fig. 4; Hiratani et al. 2009). It has previously been suggested 
that the replication timing of the Xi is important for the inher- 
itance of its epigenetic state (Chadwick and Willard 2003; Heard 
and Disteche 2006). DNA replication timing could potentially 
contribute to the transmission of epigenetic information across 
cell divisions, supporting the epigenetic maintenance of X 
chromosome inactivation and of chromatin states elsewhere in 
the genome. 

Methods 
Cell lines 

Cell lines used were from two father-mother-daughter trios, one 
with European ancestry (CEU) and one with West African an- 
cestry (YRI) (Supplemental Fig. SI). These cell lines show severe 
XCI skewing, in which the same copy of the X chromosome is 
inactive in >90% of the cells in a culture (McDaniell et al. 2010; 
Kucera et al. 2011). For both daughter cell lines, the paternally 
derived X chromosome was the clonally inactive copy (McDaniell 
et al. 2010; Kucera et al. 2011). We relied on the late replication 
of one X chromosome copy in the CEU mother (NA12892) as 
a marker for the Xi (Willard 1977) (an assignment also supported 
by the similarity to the Xi chromosomes in all other analyses we 
performed). 

Replication timing profiles 

Raw replication data was from Koren et al. (2012). For each in- 
dividual, we generated a reference sequence based on the hgl8 
version of the human genome with all SNPs in that individual 
(The 1000 Genomes Project Consortium 2010) masked, and then 
aligned the raw replication sequence reads to that reference. 

For each trio, we first identified all of the heterozygous SNPs 
in the daughter that were homozygous in at least one of the 
parents. We were thus able to assign each SNP allele to either 
a paternal or maternal origin. For the parents, heterozygous SNPs 
that were homozygous in the daughter were assigned as either 
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Figure 4. A model of replication structure regulation. Replication origins in regions that are transcriptionally active and have an open chromatin 
structure are activated at specified times (albeit with intercell variability in origin locations), giving rise to discrete domains that replicate in a synchronous 
manner. As S phase progresses, origin specification is gradually lost such that regions that replicate late use multiple origins are activated in a random 
manner, giving rise to an unstructured yet rapid replication pattern. In turn, dynamic changes in the activity of chromatin modifying enzymes over the 
course of S phase enable the preferential establishment of open or closed chromatin structures at particular genomic regions. The need to preserve 
chromatin structure during DNA replication may explain why cells sacrifice speed to achieve a specific temporal order of replication of transcriptionally 
active DNA. 



transmitted or nontransmitted. Information of crossover location 
in the CEU parents (Fan et al. 2011) was used to switch the chro- 
mosomal assignment of consecutive SNPs, resulting in fully 
phased chromosomes, yet with no parental assignment. 

For each individual with fully phased chromosomes (the 
two daughters and the CEU parents), sequence reads that over- 
lapped a phased SNP were used to generate a chromosomal copy 
specific replication timing profile as described in Koren et al. 
(2012). The number of X chromosome reads available in each cell 
line and each fraction are detailed in Supplemental Table SI. 
Non-chromosomal-copy-resolved ("diploid") data are from Koren 
etal. (2012). 

Correlations and autocorrelations 

The smoothed data were used to calculate correlations. Calcula- 
tions using the nonsmoothed S/Gl read coverage yielded similar 
results (data not shown). 

To compare the overall similarity of replication profiles along 
chromosomes, we devised a "correlation score" metric (Fig. 1) as 
follows: We (1) averaged all of the correlations between the "in- 
consistent" patterns, i.e., between maternal copies and between 
maternal copies and the diploid data (all of the blue blocks in the 
chromosome X correlation matrix in Fig. 2 A); (2) averaged all of 
the correlations between the "consistent" patterns, i.e., between 
paternal copies, between paternal and maternal copies, and be- 
tween paternal copies and the diploid data (all of the green blocks 
in the chromosome X correlation matrix in Fig. 2 A); (3) calculated 
the ratio of the correlations obtained in 1 and in 2. The correlation 
score was thereby calculated in 10-Mb windows centered every 1 
Mb along each chromosome. 

Allelic difference was calculated in 10-Mb windows centered 
every 1 Mb and averaged over the three females. Autocorrelations 
were calculated on the raw number of reads per SNP in the Gl and 
S phase data. For calculating autocorrelations along S phase, the 



ratio of the number of reads in the S phase to the number of reads 
in the Gl phase in each 100-bp window was extracted for the au- 
tosomes of each individual (including the repetition of NA19240). 
Windows within 1 Mb of a sequence gap and windows with values 
more than 10 times the median window value for each individual 
were removed from the data. The data was separated into eight 
fractions of increasing replication timing, with equal amounts of 
data in each fraction (hence, the fractions are not always separated 
by the same time difference; because autocorrelation values are 
sensitive to the amount of data analyzed, this was more reliable 
than separating the data to equal-time fractions), and autocorre- 
lations were calculated. For plotting, the average autocorrelations 
in the first 5-50 windows for each individual were used. Consistent 
results were obtained when analyzing the allele-specific data or the 
data defined by windows with a constant number of reads in the 
Gl phase (Koren et al. 2012), when separating the genome to 
a different number of fractions, and when using larger window 
sizes (data not shown). 

Features of the X chromosome 

Data regarding escape from X inactivation were extracted from 
Carrel and Willard (2005) and converted to fractional units by 
dividing the number of Xi hybrids that showed escape by the total 
number of hybrids assayed. Gene coordinates were converted to 
hgl8 coordinates. Only assayed genes were used. Locations of 
evolutionary strata were from Carrel and Willard (2005). 

Additional data sets 

Average and standard error of gene expression were based on RNA- 
seq data for CEU cell lines (Montgomery et al. 2010). DNase I hy- 
persensitive sites (DHS) were from Degner et al. (2012) and repre- 
sent the 5% most open-chromatin sites in the genomes of 77 
lymphoblastoid cell lines. 
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Data access 

Deep coverage data for selected X chromosome regions (Supple- 
mental Fig. S2) have been submitted to the NCBI Sequence Read 
Archive (SRA; http://www.ncbi.nlm.nih.gov/sra) under accession 
number SRP029958. 
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